MIRSOFT: mediator for integrating and reconciling sources using ontological functional dependencies

نویسندگان

  • Abdelghani Bakhtouchi
  • Ladjel Bellatreche
  • Stéphane Jean
  • Yamine Aït Ameur
چکیده

Providing automatic integration solutions is the key to the success of applications managing massive amounts of data. Two main problems stand out in the major studies: i the management of the source heterogeneity ii the reconciliation of query results. To tackle the first problem, formal ontologies are used to explicit the semantic of data. The reconciliation problem consists in deciding whether different identifiers refer to the same instance. Two main trends emerge in the reconciliation process: i the assumption that different source entities representing the same concept have the same key – a strong hypothesis that violates the autonomy of sources. ii The use of statistical methods that identify affinities between concepts – not suitable for sensitive-applications. In this paper, we propose a methodology integrating sources referencing shared domain ontology enriched with functional dependencies (FD). Copyright © 2012 Inderscience Enterprises Ltd. MIRSOFT: mediator for integrating and reconciling sources 73 The presence of FD gives more autonomy to sources when choosing their primary keys and allows deriving a reconciliation key for a given query. The methodology is then validated using LUBM.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Uncertain Data Integration Using Functional Dependencies

Data integration systems are crucial for applications that need to provide a uniform interface to a set of autonomous and heterogeneous data sources. However, setting up a full data integration system for many application contexts, e.g. web and scientific data management, requires significant human effort which prevents it from being really scalable. In this paper, we propose IFD (Integration b...

متن کامل

Pay-As-You-Go Data Integration Using Functional Dependencies

Setting up a full data integration system for many application contexts, e.g. web and scientific data management, requires significant human effort which prevents it from being really scalable. In this paper, we propose IFD (Integration based on Functional Dependencies), a pay-as-you-go data integration system that allows integrating a given set of data sources, as well as incrementally integra...

متن کامل

Extensible Ontological Modeling Framefork for Subject Mediation

An approach for extensible ontological model construction in a mediation environment intended for heterogeneous information sources integration in various subject domains is presented. A mediator ontological language (MOL) may depend on a subject domain and is to be defined at the mediator consolidation phase. On the other hand, for different information sources different ontological models (la...

متن کامل

Ontology Functional Dependencies

We extend traditional functional dependencies (FDs) for data quality purposes to accommodate ontological variations in the attribute values. We begin by formally defining a novel class of dependencies called ontological FDs, which strictly generalize traditional FDs by allowing differences controlled by an ontology database. The ontology databases contain information about synonyms. We then foc...

متن کامل

Reconciling Inconsistent Data in Probabilistic XML Data Integration

The problem of dealing with inconsistent data while integrating XML data from different sources is an important task, necessary to improve data integration quality. Typically, in order to remove inconsistencies, i.e. conflicts between data, data cleaning (or repairing) procedures are applied. In this paper, we present a probabilistic XML data integration setting. A probability is assigned to ea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IJWGS

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2012